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Sinusoidal coding 



The invention relates to encoding a signal, in which frequency and ampUtude 
information of at least one sinusoidal component are determined and sinusoidal parameters 
representing the frequency and amplitude information are transmiti;ed. 

5 US-A 5,664,051 discloses a speech decoder apparatus for synthesizing a 

speech signal from a digitized speech bit-stream of the type produc;ed by processing speech 
Avith a speech encoder. The apparatus includes an analyzer for proc^essing the digitized speech 
bit stream to generate an angular frequency and magnitude for eacli of a plurality of 
sinusoidal components representing the speech processed by the sf »eech encoder, the analyzer 

1 0 generating the angular frequencies and magnitudes over a sequencis of times; a random signal 
generator for generating a time sequence of random phase components; a phase synthesizer 
for generating a time sequence of synthesized phases for at least some of the sinusoidal 
components, the synthesized phases being generated from the angular frequencies and 
random phase components; and a synthesizer for synthesizing spee:ch from the time 

1 5 sequences of angular frequencies, magnitudes and synthesized phases. This document 

discloses that a great improvement in the quality of synthesized speech can be achieved by 
not encoding the phase of harmonics in voiced (i.e., composed primarily of harmonics) 
portions of the speech, and instead synthesizing an artificial phase for the harmonics at the 
receiver. By not encoding this harmonic phase information, the bits that would have been 

20 consumed in representing the phase are available for improving the quality of the other 
components of the encoded speech (e.g. pitch, harmonic magnitudes). In synthesizing the 
artificial phase, the phase and frequencies of the harmonics within the segments are taken 
into account. In addition, a random phase component, or jitter, is added to introduce 
randomness in the phase. More jitter is used for speech segments in which a greater fraction 

25 of the frequency bands are unvoiced. The random jitter improves the quality of the 

synthesized speech, avoiding the buzzy, artificial quaUty that can lesult when phase is 
artificially synthesized. 



1-2000; 





PO - DG 1 



DESC 



1 



20. 06. 2000 19.06.2000 




20-06-2000 : >HNL000332EPP • |EP00202144.2 





2 19.06.2000 

An object of the invention is to provide advantageous coding. To this end, the 
invention provides a method of encoding a signal, a method of decoding an encoded signal, 
an audio coder, an audio player, an audio system, an encoded signsil and a storage medium as 
defined in the independent claims. Advantageous embodiments are; defined in the dependent 
5 claims. The invention provides an advantageous way of applying phase jitter by transmitting 
a phase jitter parameter from the encoder to the decoder to indicate the amount of phase jitter 
that should be applied in the decoder during synthesis. Sending a phase jitter parameter has, 
inter alia, the advantage that a relation between the amount of phase jitter applied in the 
decoder and the original signal is established. In this way, more natural sound of a 

10 reconstructed audio signal is obtained, which better corresponds to the original audio signal. 
Further, the amount of phase jitter to be applied can be determined faster and more reliable, 
because it is not necessary to determine locally in the decoder the £imoxmt of phase jitter to be 
applied to generate a natural sounding signal. 

By including the phase jitter parameter in the encoded bit-stream, the bit-rate 

15 is increased. However, the increase bit-rate can be minimal since tliese phase jitter 

parameters can have a very low update-rate, e.g. once per track. A track is a sinusoidal 
component with a given frequency and amplitude, i.e. a complete siet of sinusoid segments. 
Preferably, the phase jitter parameter is transmitted approximately together with the 
frequency and the amplitude of the sinusoid at a first instance of a track. In that case, all 

20 required information is available at an early stage in the decoding. 

An alternative solution to this problem would be to transmit the original phase, 
or phase differences at various time instances such that the frequen cy can be adapted during 
synthesis to match this original phase at the respective time instances. Sending these original 
phase parameters result in a better quality but requires a higher bit- rate. 

25 In a preferred embodiment, it is assimied that phase -jitter applied to 

harmonically related frequencies bears the same harmonic relation as the related frequencies. 
It than suffices to transmit one phase jitter parameter per group of liarmonically related 
frequencies. 

The phase jitter parameters are preferably derived fi om statistical deviations 
30 measured in the original phase. In a preferred embodiment, a difference between aa original 
phase of the signal aad a predicted phase is determined, which predicted phase is calculated 
from the transmitted frequency parameters and a phase continuation requirement, and the 
phase jitter parameter is derived from said difference. With continuous phase, only a first 
instance of a sinusoid in each track may include a phase parameter, consecutive segments of 
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the sinusoid must match, i.e. calculate, their phase parameters in svich a way that they align 
with the phase of the current sinusoid segment. Reconstructed phasies based on a continuous 
phase criterion lost their relation to original phases. As explained i]i the prior art, 
reconstructed signals with a constant frequency and amplitude in conjimction with 
continuous phases, sound somewhat artificial. 

In general, it is not required that the phase jitter parEimeters indicate an exact 
amoimt of phase jitter. The decoder may perform a certain predeteimined calculation based 
on the value of the phase jitter parameter and/or characteristics of the signal. 

In an extreme case, the phase jitter parameter consists of one bit only. In this 
case, e.g. a zero indicates that no phase jitter should be applied and a one indicates that phase 
jitter should be applied. The phase jitter to be applied in the decodta: may be a predetermined 
amoxmt or may be derived in a pre-detemiined manner from charac;teristics of the signal. 

The aforementioned and other aspects of the invention will be apparent from 
and elucidated with reference to the embodiments described hereinafter. 

In the drawings: 

Fig. 1 shows an illustrative embodiment comprising an audio coder according 
to the invention; 

Fig. 2 shows an illustrative embodiment comprising an audio player according 

to the invention; and 

Fig. 3 shows an illustrative embodiment of an audio system according to the 

invention. 

The drawings only show those elements that are necessary to understand the 

invention. 

The invention is preferably applied in a general sinusoidal coding scheme, not 
only in speech coding schemes, but also in sinusoidal audio coding schemes. In a sinusoidal 
coding scheme, an audio signal to be encoded is represented by a plurality of sinusoids of 
which a frequency and an amplitude are determined in an encoder. Often, the phase is not 
transmitted, but the synthesis is performed in such a way that the phase between two 
subsequent segments is continuous. This is done to save bit-rate. In a typical sinusoidal 
coding scheme sinusoidal parameters for a number of sinusoidal components are extracted. 
The sinusoidal parameter set for one component at least consists oJ'a frequency and an 
amplitude. More sophisticated coding schemes also extract information on the course of the 
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frequency and/or amplitude as a function of time. In the simplest c;ise, the frequency and 
amplitude are assximed to be constant within a certain amount of time. This time is denoted as 
the update interval and typically ranges from 5ms - 40 ms. During synthesis, the frequencies 
and amplitudes of consecutive frames have to be connected. A tracking algorithm can be 
5 applied to identify frequency tracks. Based on this information, a continuous phase can be 
calculated such that the sinusoidal components corresponding to a single track properly 
connect. This is important because it prevents phase discontinuities, w^hich are almost always 
audible. Since the frequencies are constant over each update interval, the continuously 
reconstructed phase has lost its relation to the original phase. 

1 0 Fig. 1 shows an exemplary audio coder 2 according to the invention. An audio 

signal A is obtained from an audio source 1 , such as a microphone, a storage medium, a 
network etc. The audio signal A is input to the audio coder 2. A sinusoidal component in the 
audio signal A is parametrically modeled in the audio coder 2. A coding imit 20 derives from 
the audio signal A^ a frequency parameter / and an amplitude parameter a of at least one 

15 sinusoidal component. These sinusoidal parameters / and a are included in an encoded audio 
signal A ' in multiplexer 2 1 . The audio stream A ' is furnished from the audio coder to an 
audio player over a communication channel 3, which may be a wireless coimection, a data 
bus or a storage medium, etc. At the encoder, a sinusoidal track is identified. This means that 
at two time instants and the frequencies and phase are known. From the frequency track 

20 and phase at the phase at t2 can be predicted. This is preferably done in a same way as in a 
decoder. The error of the prediction of the phase at t2 and the actual measured phase can be 
calculated. A characteristic value of this error, e.g. mean absolute ^^alue or a variance, can be 
determined. Preferably, the phase jitter parameter is derived from this characteristic value. In 
this way, the required phase jitter is determined in the encoder, by calculating the difference 

25 between the actual phase and the phase determined from the sinusoidal parameters in the 
encoder. A phase jitter parameter derived from this difference is tnmsmitted to the decoder 
which uses the phase jitter parameter to introduce a derived amount of phase jitter by 
changing slightly the phase of the corresponding signal in the syntlaesis. 

An alternative way of determining the phase jitter p.arameter is to monitor 

30 fluctuations in the original frequency. 

An embodiment comprising an audio player 4 according to the invention is 
shown in Fig. 2. An audio signal A * is obtained from the conunimication channel 3 and de- 
multiplexed in de-multiplexer 40 to obtain the sinusoidal parameters / and a and the phase 
jitter parameter p that are included in the encoded audio signal A \ These parameters/ a and 
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p are furnished to a sinusoidal synthesis (SS) unit 41 . In SS unit 41 , a sinusoidal component 
S * is generated which has approximately the same properties as ttie sinusoidal component S in 
the original audio signal A. The sinusoidal component S' is multiplexed together with other 
reconstructed components and output to an output unit 5, which may be a loudspeaker. At the 
5 decoder, the phase jitter parameter p is available. Next to determining the phase of the signal 
at each instant by using phase continuation and some way of frequisncy (and thus phase) 
interpolation, the phase jitter parameter is used to add a disturbance to the constructed phase 
interpolation. This new phase is then treated as 'original phase', to the extent that the 
frequencies are adjusted during synthesis to match these new phas<5 values. 

10 Fig. 3 shows an audio system according to the invention comprising an audio 

coder 2 as shown in Fig. 1 and an audio player 4 as shown in Fig. 2. Such a system offers 
playing and recording features. The commimication channel 3 ma> be part of the audio 
system, but will often be outside the audio system. In case the communication channel 3 is a 
storage medium, the storage medium may be fixed in the system oi- may also be a removable 

1 5 disc, tape, memory stick etc. 

It should be noted that the above-mentioned embodiments illustrate rather than 
hmit the invention, and that those skilled in the art will be able to design many altemative 
embodiments without departing from the scope of the appended cLiims. In the claims, any 
reference signs placed between parentheses shall not be constmed as limiting the claim. The 

20 word ^comprising' does not exclude the presence of other elements; or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodit^d by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 

25 dependent claims does not indicate that a combination of these me^isures cannot be used to 
advantage. 

In summary, encoding a signal is provided, wherein frequency and ampUtude 
information of at least one sinusoidal component in the signal is determined, and sinxisoidal 
parameters representing the frequency and amplitude information jure transmitted, and 
30 wherein further a phase jitter parameter is transmitted, which represents an amount of phase 
jitter that should be added during restoring the sinusoidal component from the transmitted 
sinusoidal parameters. 
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CLAIMS: 



1 . A method of encoding (2) a signal (A), the method comprising the steps of: 

determining (20) frequency and ampUtude infomiati on of at least one 

sinusoidal component in the signal (A); and 

transmitting (22) sinusoidal parameters (f,a) representing the frequency and 
5 amplitude information; 

characterized in that the method (2) further compris<2s the step of: 
transmitting (22) a phase jitter parameter (p) representing an amount of phase 

jitter that should be added dviring restoring the sinusoidal component from the transmitted 

sinxisoidal parameters (f^a). 

10 

2. A method (2) as claimed in claim 1, wherein the phase jitter parameter (p) is 
transmitted (22) approximately together with the sinusoidal param(5ters (f,a) at a first instance 
of a track. 

15 3. A method (2) as claimed in claim 1 , wherein a phas<5 jitter parameter (p) is 

transmitted for a given group of sinusoidal components, which sinusoidal components have 
harmonically related frequencies. 

4. A method (2) as claimed in claim 1 , the method (2) ftirther comprising the 
20 steps of: 

determining (20) a difference between a phase of the sinusoidal component 
and a predicted phase, which predicted phase is calculated from th<5 transmitted sinusoidal 
parameters (f,a) and a phase continuation requirement; and 

deriving (20) the phase jitter parameter (p) from said difference. 

25 

5. A method of decoding (4) an encoded signal (A*), tlie method comprising the 
steps of: 

receiving (40) sinusoidal parameters (f,a) representing frequency and 
amplitude information of at least one sinusoidal component; 
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restoring (41) the at least one sinusoidal component from the sinusoidal 
parameters (f,a); 

characterized in that the method further comprises: 
receiving (40) a phase jitter parameter (p); 

adding (41) an amount of phase jitter to the sinusoidal component, which 
amount of phase jitter is derived from the phase jitter parameter. 

6. An audio coder (2) comprising: 

means (20) for determining frequency and amplitude information of at least 
one sinusoidal component in the signal (A); and 

means (22) for transmitting sinusoidal parameters (f ^a) representing the 
frequency and amplitude infomiation; 

characterized in that the audio coder (2) ftirther comprises: 

means (22) for transmitting a phase jitter parameter (p) representing an amount 
of phase jitter that should be added during restoring the sinusoidal component from the 
transmitted sinusoidal parameters (f,a). 

7. An audio player (4) comprising: 

means (40) for receiving sinusoidal parameters (f,a) representing frequency 
and amplitude information of at least one sinusoidal component; 

means (41) for restoring the at least one sinusoidal c omponent from the 
sinusoidal parameters (f,a); 

characterized in that the audio player ftirther comprises: 

means (40) for receiving a phase jitter parameter (p) ; 

means (41) for adding an amount of phase jitter to tlie sinusoidal component, 
which amount of phase jitter is derived from the phase jitter parameter. 

8. An audio system comprising an audio coder (2) as claimed in claim 6 and an 
audio player (4) as claimed in claim 7. 

9. An encoded signal (A') comprising sinusoidal pararaeters (f,a) representing 
frequency and amplitude information of at least one sinusoidal component and ftirther 
comprising a phase jitter parameter (p) representing an amount of ]>hase jitter that should be 
added during restoring the sinusoidal component from the sinusoidal parameters (f,a). 
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is stored. 
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A storage medium (3) on which an encoded signal (A') as claimed in claim 9 
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ABSTRACT: 



Encoding (2) a signal (A) is provided, wherein frequency and amplitude 
information of at least one sinusoidal component in the signal (A) i s determined (20), aad 
sinusoidal parameters (f,a) representing the frequency and amplitude information are 
transmitted (22), and wherein ftirther a phase jitter parameter (p) is transmitted, which 
5 represents an amoimt of phase jitter that should be added during restoring the sinusoidal 
component from the transmitted sinusoidal parameters (f,a). 



Fig. 1 
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